Commercial skipping standalone device

ABSTRACT

Specified video segments within a video stream are identified. The identified video can represent commercials or can represent portions within the program information. The user can identify these undesired portions of the video. A signature generation unit produces a signature that indicates information about the undesirable video. The signatures are stored. Current video is compared with the signatures to identify portions of the video that correspond to the specified video segments. The identified portions are handled in a different way, for example, they can be played less prominently, or skipped. The signal indicative of the video stream can be produced at a different time than the signature generation. In addition, the signature generation can use biometric techniques.

BACKGROUND

Attempts to skip commercials in broadcast TV are well-known. Many ofthese attempts correlate over the video in order to identify portions ofthe video which are likely to represent commercials. So-called replayTVunits, available from Sonic Blue Inc., form a digital VCR whichdigitizes the incoming television and records it on a hard drive. Thedigital VCR records the signal, forms some kind of index that hasinformation for use in locating the commercials, and during playback,automatically skips these commercials.

This device may work remarkably well, however there are concerns thatsuch a device may violate copyright laws. Moreover, there are times whena user might want to skip other sections of video that are notcommercials.

SUMMARY

The present invention teaches a system in which the user controlsvarious aspects of the video identification and playback in order toidentify and later skip desired selections. The identification unit maybe totally separate from the device that actually does the recording. Inaddition, a preferred operation is responsive to user input to formspecified signatures representing the undesired video. Since the userselects which parts of the video are undesired, the user has controlover which parts of the video may be automatically skipped. Theidentification unit may index the recording by analyzing the recordingto determine likely commercials. Different embodiments of this systemare disclosed.

BRIEF DESCRIPTION OF THE DRAWINGS

These and other aspects will now be described in detail with referenceto the accompanying drawings, wherein:

FIG. 1 shows a first embodiment in which a special identification unitis used to analyze video content and skip over portions of the videocontent based on user-identified criteria.

FIG. 2 shows a second embodiment in which a remote control is suitablyprogrammed to carry out the analysis and make determinations of suitablecontent for skipping

FIG. 3 shows a flowchart of an embodiment used to determine signaturesfor identifying video sequences.

DETAILED DESCRIPTION

FIG. 1 shows a video source 100 which produces output video 102.Depending on the configuration of the device, the video 102 may beanalog or digital signals. The video source may be a conventional sourceof broadcast video such as a television tuner, or a cable or digitalcable line, satellite receiver, or the like. Video source may also be adigital tape recorder such as a replay TV type unit or TIVO, or ultimateTV type unit. The video signal 102 is input into the ID unit 110. IDunit 110 has an internal memory 112 which stores some portion of theincoming video. In one specific embodiment, the memory may be forexample 1 GB of memory, in either a miniature hard drive or in randomaccess memory, capable of storing approximately one half hour ofplayback video. Of course, the memory may be larger as desired. However,storage of one half hour of video will enable most desired commercialskipping operations.

The incoming video is coupled to the memory 112 and an A/D converterunit 114 may optionally be provided to digitize the signal in the caseof an analog input. When signal is applied to the A/D unit, itimmediately begins recording.

The ID unit also includes a signature memory 116 which may be anonvolatile memory that stores signatures indicative of known undesiredvideo segments. The signature may be very simple for example may beaverage luminance of the undesired video, or may be much more complex.Any type of signature which is known in the art to represent video orvideo segments can be used. One simplistic signature may be averageluminance. Another is described in U.S. Pat. No. 5,581,658. Othersignatures can alternatively be used; any signature that characterizesthe video signal. According to an embodiment disclosed herein, a specialsignature is recorded which may be advantageous in analyzing the contentof a commercial, although other signatures of sex scenes, violencescenes and the like may alternatively be provided.

The signatures in the signature store 116 may be changed as desired.Signature store 116 is shown connected to a network connection 122 whichenables the signatures in the unit to be updated via network. This maybe part of a subscription service or a free service on the Internet thatidentifies known commercials and/or undesired video. For example, thismay be used to identify sex scenes and/or violence scenes in knownvideos, to allow skipping over those scenes. When used in this way, thissystem may be used to edit out certain scenes during playback.

Another and more preferred way of storing the signatures is by havingthe user identify the undesired video. A special remote control 150 isprovided for use with the device 110. The remote control 150 includes aundesired video identifying (“UVI”) button at 152. When depressed, theundesired video button 152 sends a signal 154 to a correspondinginfrared receiver of conventional type 124, located within unit 110.Preferably, the user holds down the UVI button 124 for the entireduration of the undesired scene. During the time that the UVI button isbeing depressed, the unit does two things: first it sends a signal tothe signature unit 116 indicating that the incoming video representsundesired video. This incoming video is then stored for later processingto form a signature indicative of that undesired video. In addition, theunit may send a signal to the playback unit 126 which controls playbackof the stored information from memory 112. This causes the playback unitto either skip a specified period (e.g. 30 seconds), or play the videoat faster speed, for example at a double-speed or quadruple speed asconventional. Therefore, the user sees the video at faster-then-usualspeed and at the same time marks it as being an undesirable part of thevideo.

In an alternative embodiment, only the beginning of the undesiredportion is marked by pressing the UV button 152 only one time, right atthe beginning of the undesired portion. The signature formation unit 117then automatically identifies the end of the current scene (orcommercial) and automatically forms a signature.

As described above, this system may be used for skipping many kinds ofvideo content. In addition, alternative ways may be used for identifyingthe commercials. For example, a single depression of the commercialbutton may be used to identify a commercial break, and video processingtechniques may be used to determine the end of the commercial break orthe end of the scene being viewed. For example, when there is a changein the luminance of the scene by more than 10%, this may signify thatthe end of the scene is being viewed.

An advantage of this system is that the user forms and stores their ownsignatures. The user can select what parts of the video to be watchedand other parts that are not being watched. If the user desires to seesome commercials or other video, the user can see those. Alternatively,however, the user can skip other commercials by entering signatures intothe unit 116. In addition, the signature unit may include a reset button118 which may be used in order to clear the signature store and startover.

However formed, video 102 is compared with the signatures in store 118by a comparison unit 128. The comparison unit compares the incomingvideo with the signatures and produces an output signal 129 which mayindicate “skip during play”. The output signal 129 controls the playbackunit 126. Therefore, if the comparison unit 128 detects a 25 secondcommercial, it may produce a digital signal at 129 which tells theplayback unit to skip forward by 25 seconds.

In operation, a video source is applied, and automatically fills thememory 112 with video. When the user wants to watch that video,preferably at least about a half hour's worth of video storage, or aftersome certain segment of video has been entered into the playback unit126, the user selects play on the remote control, causing the playbackunit to read from the memory 112 and thereby produce the output videosignal 160. During the playback, the video is continually compared withthe signatures in unit 116 by comparison unit 128. Any match causes thisportion of the playback which is currently identified to beautomatically skipped.

Since the user sets the signatures, use of the unit may be less likelyto be considered a copyright infringement. In addition, since the IDunit 110 is separate from any recording part, it can be used with manydifferent video sources.

FIG. 2 shows an alternative embodiment in which the undesired videoskipping operation is carried out in a remote control unit whichinterfaces with a digital VCR 200 such as TIVO, TV replay type unit orthe like. In this embodiment, both a digital VCR 200, and the remotecontrol 210 include wireless network capabilities, which can be wirelessethernet such as 802.11a or 802.11b network, a Bluetooth network, or anyother type of wireless network command. Digital video recorder (“VR”),200 includes a wireless network unit 202. Remote control 210 includes awireless network part 214 which communicates with the correspondingnetwork unit 202 in digital VR 200.

In operation, the remote control 210 includes conventional buttons suchas play, fast forward and stop. Remote control 210 also includes aspecial commercial button 252. This button is depressed to identify acommercial. During playback, digital VR sends information indicative ofadvanced video which will be played from the hard drive 204 within thedigital VR over the wireless network 203 to the remote control.Therefore, the remote control receives information indicative of thevideo which will be played in the future. In one embodiment, this may bea reduced resolution version of the video, since it will only be usedfor analyzing signatures. In this embodiment, the signature storage unit220 is located in the remote control. The video which is received 216 iscompared with the signatures in the signature storage unit by comparisondevice 224. This comparison produces an index 226 which is used to drivethe playback. In this embodiment, the control of the digital VR is shownas being carried out over the wireless network, although an infraredcontrol may also be used. For example, if the signature comparer 224indicates that an undesired video clip is playing, at some future time,an entry in the index unit 226 is made indicating the time. This entryis used to tell that the digital VR to skip over the time that thecommercial or undesired video. As in the embodiment of FIG. 1, the UVIbutton 252 is used to form a signature using the signature forming unit223 to analyze the incoming video and to store the signature in thesignature storage unit 220. Again, this enables skipping any type ofundesired video and is not limited to commercials although it may beused for commercials.

As noted above, any conventional method known in the art for formingsignatures may be used for identifying the undesired portions of video.Any signal that characterizes the video may be used as a signal.However, one specifically advantageous system is shown herein. This maybe used, for example, by a processor that is processing the videostream, as shown in the flowchart of FIG. 3. At 300, an UV segment isidentified. A random number generator, which may be a software functionis then used to generate a frame number. The video is advanced by thisframe number to investigate another frame which is then analyzed. Mostcommercials will include a picture of a person within the commercial.The frame is analyzed from left corner downward to look for a picture ofperson's face which is identified as face 315. Digital informationindicative of the face is stored along with additional information aboutthe face. After storing that face information, the system continuescorrelating down from the left corner looking for a geometric object ofrelatively consistent color. The geometric object 316 is found, andinformation indicative of the geometric object (e.g., it is a ofspecified size, for example), is stored along with its position. Thisforms a signature at 320 which includes the frame number, faceinformation and position, and geometric information and position.

An advantage of this signature is that conventional face recognitionsoftware may then be used to analyze the incoming video stream torecognize the face. This face recognition software may operaterelatively quickly, and is an established technology. In addition, thegeometric information may use techniques which are known in videorendering.

This embodiment may be used to identify any biometric part, usingbiometric identification techniques.

Since the frame number is known, and the number of frames per second invideo are known (typically 30 frames per second interleaved), this canbe used to identify the beginning point of the commercial. The end pointof the commercial may also be identified using conventional techniques.

Although only a few embodiments have been disclosed in detail above,other modifications are possible for example, while the above describesskipping over the undesirable video, it should be understood that anysystem that allows it to be played less prominently, including playingthe undesirable video at a faster speed, muting the undesirable video orlowering the luminance of the undesirable video may also be used.

1. A video identification system, comprising: a first unit, that storesplural signatures characterizing segments of video that are requested tobe edited during playback; a comparing unit which operates to obtain areduced resolution version of full resolution video which will be playedat a future time, and to compare said reduced resolution version of saidvideo that will be played at a future time with said signatures toidentify at least one segment of video requested to be edited duringplayback at said future time and produces an entry indicating at leastone time of video to be skipped during playback; and a playback part,that automatically determines if current video should be skipped basedon said entry created by said comparing unit, and automatically playingsaid full resolution video that is not associated with an entryindicating that the video should be skipped during playback, and notplaying the video when said video is associated with the entryindicating that the video should be skipped during playback.
 2. A systemas in claim 1, wherein said playback part is within a remote control,and said comparing unit is physically within said remote control.
 3. Asystem as in claim 1, wherein said segments of video are less than anentire video that is being viewed, and each of a plurality of saidsignatures include a time indicative of a length of a segment of videorepresented by said each signature, and said comparing unit skips saidvideo by moving a playback forward by said time indicative of saidlength of said segment of video associated with said each signature. 4.A system as in claim 1, further comprising a network device said pluralsignatures over a network connection and periodically updating saidsignatures via the network connection, where said plural signatures thatare received over the network connection are received in a form whereeach said signature is less information than one of said segments ofvideo represented by said each signature, and where said network devicecarries out said updating to cause new signature information indicativeof new segments of video to be used by said comparing unit.
 5. A systemas in claim 1, wherein said signatures are received in a first form, andare stored in said first unit in said first form, without modifying acontent of said signatures prior to storing.
 6. A video identificationsystem, comprising: a first unit that obtains a reduced resolutionversion of video, and identifies segments of video that are requested tobe edited during playback in said reduced resolution version of video; asignature generation unit, having a control operable by a user, whichgenerates signatures characterizing said segments of video that arerequested by a user using said control to be edited during playback,where said segments of video are less than an entire video and areidentified within said reduced resolution version of the video; and acomparing unit which compares current video with said signatures toidentify said segments of said video requested to be edited duringplayback, wherein said video includes program information and alsoincludes commercials, and wherein said segment of video that isrequested by said user using said control to be edited during playbackrepresents at least one portion of the program information that is notcommercials and that represents at least one of violence scenes and/orsex scenes in said video, and where at least a portion of said videoother than said segments of video that are requested to be editingduring playback are allowed to be played and are played using a fullresolution version of the video, and parts which are edited duringplayback are automatically prevented from being played.
 7. A system asin claim 6, wherein said signature generation unit also stores a time ofeach of a plurality of stored signatures.
 8. A system as in claim 6,wherein said signature accessing unit includes a signature thatrepresents a face within a video stream, and a location of said facewithin said video stream relative to an object in said video stream andsaid comparing unit includes a face recognition unit that looks for saidface within said video stream at said location relative to said object.9. A video identification system, comprising: a first unit that obtainsa reduced resolution version of video, and identifies segments of videothat are requested to be edited during playback in said reducedresolution version; a signature access unit which accesses at least onesignature characterizing said segments of video that are requested to beedited during playback; and a comparing unit which compares said reducedresolution version with said signature to automatically identifyportions of the video requested to be edited during playback, whereinsaid video includes program information and also includes commercials,and wherein said segment of video that is requested to be edited duringplayback represents at least one portion of the program information thatis not commercials and that represents at least one of violence scenesand/or sex scenes in video, wherein said parts that are not editedduring playback are full resolution versions of the video.
 10. A systemas in claim 9, further comprising a remote control, and wherein saidsignature access unit and comparing unit are physically within a housingthat holds the remote control, wherein said remote control has a controlthat indicates that currently viewed video represents video requested tobe edited during playback when said control is actuated, and produces asignal, sent to said signature accessing unit, indicative of a selectionof video segment that are requested to be edited during playback.
 11. Asystem as in claim 10, wherein said signature accessing unit isresponsive to said signal that is produced by actuation of said controlto produce a signature indicative of said video.
 12. A method,comprising: operating to obtain a reduced resolution version of fullresolution video which will be played at a future time, and to analyzethe reduced resolution version of said full resolution video; forming asignal indicative of a portion of a first video stream in said reducedresolution version, which signal includes information indicative of abiometric part within said video stream and a location of said biometricpart within said video stream relative to an object within said videostream; and using said signal, including said information indicative ofsaid biometric part, with a computer that is programmed to automaticallyrecognize said biometric part and said location of said biometric partrelative to said object from said signal in a second video stream whichis a full resolution video stream, said comparing operating to comparesaid reduced resolution version of said video that will be played at afuture time with said information indicative of said biometric part toidentify at least one segment of video requested to be edited duringplayback at said future time and produces an entry indicating at leastone time of video to be skipped during playback; and based on said usingto recognize said biometric part, altering playing of at least a part ofthe full resolution video stream in which the biometric part wasrecognized based on said entry.
 13. A method as in claim 12, whereinsaid forming occurs at a first time, and said using occurs at a secondtime, subsequent to said first time.
 14. A method as in claim 12,wherein said signal indicative of said first video stream includes anindex signal indicative of times of the portion including at least atime that the video stream takes to play.
 15. A method as in claim 14,wherein said altering comprises skipping forward within a video segmentby said time that the portion takes to play.
 16. A method as in claim12, wherein said computer is programmed to recognize a geometric objectwithin said video stream, and said location is a location relative tosaid geometric object.
 17. A system comprising: a stored video unit,which stores a reduced resolution version of video to be played, andalso stores identifying information from which sections of said videocan be identified during playing; and said video unit including aninterface that receives commands from a remote control unit, wherein aplaying command controls playing of said video, a memory, storingidentifying information identifying multiple different sections ofvideo, said video unit recognizing, other sections of video which aresimilar to said section of video based on said identifying information;a comparing unit which operates at a current time to compare saidreduced resolution version of said video to be played with saididentifying information to identify at least one segment of the videorequested to be edited during playback and produces an entry indicatingat least one segment of video to be skipped during playback; and aplayback part, that receives current full resolution video to be played,automatically determines if the current video should be skipped based onsaid entry created by said comparing unit, and playing said fullresolution video that is not associated with an entry indicating thatthe video should be skipped during playback, and does not play the videowhen said video is associated with the entry indicating that the videoshould be skipped during playback.
 18. A system as in claim 17, whereinsaid identifying information includes information indicative of abiometric part within said video and a location of said biometric partwithin said video.
 19. A system as in claim 17, wherein said videoincludes program information, and commercial information that isseparate from said program information, and wherein at least one item ofsaid identifying information identifies a part of said programinformation that represents at least one of a sex scene or a violencescene and not said commercial information.
 20. A system as in claim 17,wherein said sections of said video which are similar includes acommercial.
 21. A system as in claim 17, wherein said remote controlunit includes a wireless network protocol, which communicates with saidstored video unit.
 22. A system as in claim 17, further comprising anetwork connection, receiving said identifying information over anetwork connection, storing the identifying information, andperiodically updating said identifying information via the Internet,where said identifying information that is received over the networkconnection is received as less than an entire video portion representedby the identifying information, and where said updating causes newidentifying information indicative of new sections of video to be readby said video unit for recognizing said sections of video.
 23. A systemas in claim 17, wherein said identifying information is received in afirst form, and are stored in said memory in said first form, withoutmodifying a content of said identifying information prior to storing.24. A method, comprising: reading a reduced resolution informationindicative of playing a video stream, said information including bothprogram information parts within the video stream and commercial partswithin the video stream; reading segment information that is indicativeof portions of said program information parts and not said commercialparts, within said video stream, wherein said segment informationincludes information indicative of at least one of violence scenes andor sex scenes in the program information parts; comparing, at a currenttime, portions of said reduced resolution information with said segmentinformation to identify at least one segment of the video requested tobe edited during playback at said future time and produces an entryindicating at least one time of video to be skipped during playback atsaid future time; playing a full resolution video stream, andautomatically recognizing portions of the full resolution video streamindicated by said segment information as recognized portions based onsaid entry created by said comparing, and automatically playing parts ofsaid full resolution video stream that is not associated with an entryindicating that the video should be skipped during playback, and notplaying the full resolution video when said video is associated with theentry indicating that the video should be skipped during playback; andproducing an output from said playing, including playing video fromportions of said video stream that are not recognized by saidrecognizing, and a different operation for portions of said video streamthat are recognized by said recognizing.
 25. A method as in claim 24,further comprising allowing a user to identify parts of said video, andproducing said segment information, and wherein said allowing occurs ata first time, and said producing occurs at a second time, subsequent tosaid first time.
 26. A method as in claim 24, wherein said differentoperation comprises skipping said portions of the video streamrecognized by said recognizing.
 27. A method as in claim 24, furthercomprising receiving said segment information over a network connection.28. A method as in claim 24, further comprising periodically updatingsaid segment information via the Internet, such that new segmentinformation indicative of new portions of said program information partsare read by said reading segment information.
 29. A method as in claim24, wherein said segment information includes information thatrepresents a face within a video stream, and a location of said facewithin said video stream relative to an object within said video streamand said recognizing looks for said face within said video stream atsaid location relative to said object.
 30. A method as in claim 29,wherein said object is a geometric object within said video stream, andsaid location is a location relative to said geometric object.